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Abstract 



This paper focuses on the problem of the estimation of the cumula- 
tive hazard function of a distribution on a general complete separable 
metric space when the data points are subject to censoring by an ar- 
bitrary adapted random set. A problem involving observability of the 
estimator proposed in [8] and [9] is resolved and a functional central 
limit theorem is proven for the revised estimator. Several examples 
and applications are discussed, and the validity of bootstrap methods 
is established in each case. 
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1 Introduction 



In two recent papers, [8j and [9], the problem of survival analysis on a gen- 
eral complete separable metric space was approached from the point of view 
of set-indexed martingales (cf. [6]). The goal of these papers was to build 
a mathematical structure that would handle data on general spaces (includ- 
ing, but not limited to (i- dimensional Euclidean space) subject to very general 
types of censoring mechanisms. In particular, the cumulative hazard func- 
tion was defined on a class of sets, and a corresponding Nelson-Aalen-type 
estimator was proposed. 

In [8j, the censoring mechanism in a set-indexed framework was defined by 
introducing the concept of a stopping set, a random set with a particular kind 
of measurability. The stopping sets acted as windows, since we could observe 
the data points only through them, and they permit the consideration of more 
sophisticated censoring schemes than the usual multivariate generalizations 
of right-censoring; however, the stopping sets were still restricted to certain 
shapes. 

The theory of stopping sets was expanded to more general random sets 
called anti-clouds in [9]. The name is inspired by the example of aerial 
photography, where pictures are taken from high in the air and clouds may 
interfere with the observation of whatever the subject of interest is. Anti- 
clouds, then, are the complement of the clouds: the regions where we can 
actually observe the data points. The difference between the stopping sets 
and the anti-clouds is, quite simply, that anti-clouds have virtually no re- 
strictions in the shape they can take; obviously, this makes for much more 
realistic censoring schemes. 

The measurability requirement imposed on these random sets allow one 
to apply the theory of set-indexed martingales as developed in [6] to produce 
a set-indexed Nelson- Aalen estimator for the cumulative hazard on a general 
complete separable metric space in the presence of generalized censoring. In 
the case of censoring by stopping sets, consistency and asymptotic unbiased- 
ness of the estimator, as well as asymptotic normality of its finite-dimensional 
distributions, are proven in The Nelson-Aalen estimator in the presence 
of censoring by clouds presented more of a challenge, due to the general ge- 
ometric nature of the clouds; nevertheless, the estimator was shown to be 
consistent and asymptotically unbiased in [9]. 

Unfortunately, there were still some gaps in the general theory. Aside 
from the lack of a functional central limit theorem for the general estimator 
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of [0], there remained a critical problem limiting the applicability of the 
estimator. As will be seen subsequently, a process needed to construct the 
Nelson-Aalen estimator may not be observable under some common data 
structures. In practice, this could render the estimator unusable much of the 
time. 

Our first goal is to address the observability problem and produce a work- 
ing estimator for the cumulative hazard, while still preserving the general 
censoring model of [9]. It happens that the root of the observability prob- 
lem lies in the measurability requirement of the anti-clouds. By asking for 
a slightly weaker measurability condition, it is possible for us to achieve our 
objective. We call the new kind of random set that came from this modifi- 
cation a *-anti-cloud. If the complements of these sets, the *-clouds, act as 
the censoring in the survival model, then we are always able to construct a 
Nelson-Aalen estimate that can be observed. 

Our second goal is to prove a functional central limit theorem for the 
Nelson-Aalen estimator under generalized censoring on Euclidean space. In 
this case, we can also establish the validity of bootstrap procedures. 

It should be pointed out that various estimators for the cumulative haz- 
ard have been proposed for bivariate data under right censoring. However, 
for the most part they have been designed en route to producing a Kaplan- 
Meier-type estimator for the survival function. Unfortunately, in higher di- 
mensions the relationship between the survival and the hazard functions is 
not as straightforward as it is in one dimension. In two dimensions, for ex- 
ample, the survival function is determined by the hazard and both marginal 
distributions. For a good exposition of the representation of a bivariate sur- 
vival function 5* in terms of the hazard and the marginals, we refer the reader 
to [S] or [T^. Usually, one-dimensional Kaplan-Meier estimates are used for 
the marginal survival functions; what will change is the way the hazard is 
estimated. As Kalbfleisch and Prentice explain in ^lOj, the simplest of this 
type of estimator is due to Bickel, who uses the ratio of the number of failure 
points that were uncensored in both components at (ti, ^2) to the number of 
pairs at risk at that point. The Dabrowska estimator and the Prentice-Cai 
estimator represent attempts at improving this approach, by considering not 
only the ratio of the number of double failure times to the at-risk set, but also 
the ratio of the number of failures on one coordinate when the other is still 
alive to the at-risk set. Lastly, Pons proposed a martingale-based estimator 
of the cumulative hazard of two right-censored survival times in [TT], which 
she then used to produce a test for independence of the two survival times 
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by comparing it to the product of the usual one-dimensional Nelson-Aalen 
estimates of both marginal cumulative hazards. However, her model required 
independence of the components of the survival time. 

All of the preceding estimators depend strongly on the structure of Eu- 
clidean space and as well on the censoring mechanism, which is generally 
assumed to be right censoring of each component of the data point sepa- 
rately. The advantage of the martingale methodology used here is that it 
provides a unified and versatile approach to hazard estimation; in particular 
it is applicable to both Euclidean and non-Euclidean spaces, and may be 
applied to very general censoring mechanisms. 

We will proceed as follows. In ^ we provide the mathematical frame- 
work for our model. We develop the notion of the *-anti-cloud and show 
that certain martingale properties are preserved under filtering by this more 
general censoring mechanism. In ^ the observability problem of the origi- 
nal Nelson-Aalen estimator is discussed. The results of [|2] allow us redefine 
the Nelson-Aalen estimator of the cumulative hazard under censoring by *- 
clouds. We present several examples to illustrate how our new estimator can 
be used to circumvent the observability problem through a transition from 
the anti-cloud model to the *-anti-cloud structure. The price we pay is a 
possible loss of previously uncensored data that will need to be discarded. In 
^ we restrict our attention to Euclidean space, and show that the Nelson- 
Aalen estimator satisfies a functional central limit theorem. These results 
are applied in ^ to show how the Nelson-Aalen estimator can be used to 
develop various tests involving the dependence structure of the underlying 
distribution. The validity of bootstrap methods is established in each case. 

Most of these results can be found in additional detail in [2] (the doctoral 
thesis of the first author), where estimation of the survival function is also 
considered. 

2 Framework and Definitions 
2.1 The Set-up 

The framework we will be using is very similar to that used in [B], [S] and 
[H|, except for some details. Let T be a compact complete separable metric 
space and A a finite measure on B, the Borel sets of T. A simple example 
is T = [0, l]'^, and we will return to this space in the sequel. Preserving 
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the notation in [HI, if I' is an arbitrary class of sets, then T>{u) will denote 
the class of finite unions of sets from V. For D an arbitrary subset of T, 
let Td be a countable dense subset of D. We will use 'c' to indicate strict 
inclusion; moreover, D denotes the closure and D° denotes the interior of D. 
All processes will be indexed by sets belonging to an indexing collection A: 

Definition 2.1 A nonempty class A of compact, connected subsets of T is 
called an indexing collection if it satisfies the following: 

1. 0, T G A, and for A E A, A° ^ A if A ^ (J) or T. 

2. A is closed under arbitrary intersections and if A, B E A are nonempty, 
then A n B is nonempty. Denote 0' = n^g_4^^^0yl. // (Ai) is an in- 
creasing sequence in A, then UjA, G A. 

3. The a-algebra generated by A, (^{A) = B. 

4- Separability from above: 

There exists an increasing sequence of countable subclasses An 
= {A^, A2, ...} of A closed under intersections and satisfying 0,0' G 
An{u). As well, there exists a sequence of functions Qn '■ A ^ An{u) 
such that 

(a) Qn preserves arbitrary intersections and finite unions (i.e. 
gni^AeA'A) = ^AeA'gn{A) for any A' C A, and if Uf^^Aj = 
U™ 1^;., then utigniA,) = Uf^,gn{A'^) ) , 

(b) for each A E A, A C (g^iA))" , 

(c) gn{A) C g„,{A) ifn>m, 

(d) for each A E A, A = nnfi'n(A) , 

(e) if A, A' E A then for every n, gn{A) Ci A' E A, and if A' E An 
then gniA) n A' E An- 

(f) ^?n(0) = Vn. 

Definition 2.2 For t E T, the 'past' oft is the set At := fl^e^, teA^ ■ 

We observe that the indexing collection A induces a partial order on T: 
s < t if and only if As ^ Af. In other words. At = {s : s < t}. We can now 
define the 'future' of t G T to be the set Et = {s : s > t}. It will be assumed 
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that Et is closed. In terms of this partial order, 0' is the minimal element of 
T. 

The following definitions can be found in [9j: C is the class of all subsets 
of T of the form 

C = A\B, AeA, Be Aiu). 

C is closed under intersections and any set in C{u) may be expressed as a 
finite disjoint union of sets in C; in other words, C is a semi-algebra. 

For each Ak, let C^{Ak) denote the left-neighbourhoods of Ak- that is, 
all nonempty sets of the form C = A \ Uyi'e^fc.^gA' ^ ^ -^k- It is clear 
that C^{Ak) partitions T, and as observed in [S], the sequence {C^{Ak))k is a 
dissecting system for T. 

Example 2.3 As an example of the type of structure we will be using, con- 
sider T = [0, 1]^. Then the class A = {[0,t] : t e T} satisfies the conditions 
of Definition 12.11 and C{u) consists of all finite unions of disjoint rectangles 
of the form (s, t] for s, t G T. 

For T = [0, If and AeT,A will be called a lower layer if [0, t] C A, Wt e 
A. The class A = {A C T : A is a lower layer} is also an indexing collection. 
In both cases we have that At = [0,t] and Et = [t,l], and both indexing 
collections induce the usual partial order in [0, 1]^. 

It is important to note that we can define other indexing collections that 
induce different partial orders. Example 2.4 in [8] illustrates this point. 

We will require some extra assumptions. The first one is about the struc- 
ture of C, while the second one is concerned with the sub-semilattices Ak'- 

Assumption 2.4 If C e C, then there exists a representation C = A \ 
U™iA (A,Di,...,D^ G A) such that %f A' e A, A' f] C = 0, then A' C 
U™i-Di. This is called a maximal representation of C. 

Assumption 2.5 For each C G C^{Ak), there exists a point tc- G T such 
that C C Et^_ , and 

lim sup sup ^) = 0, 

where d is the metric on T . Moreover, tc- can he chosen so that tc- G dC . 

The point tc- acts as a lower bound on the points in C: tc- < t \ft E C . 
For example, \i A = {[Q,t\ : t e T} and C = (t, t'], then tc- = t. 
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Definition 2.6 Let (f2,jF, P) be any complete probability space. A filtration 

indexed by A is a class of complete sub-a-fields of T , {T a '■ A G A) such 
that 

• IfACB, then Ta ^ Tb VA, B ^ A. 

• -^nA, = n^A, for any decreasing sequence (Aj) in A {Monotone outer- 
continuity). 

We continue to follow the definitions in [9]. If i? G A{u)., then T% = 
\l A&A, AciB^A- The (j-algebras {JF^ : B G A{u)} are complete and in- 
creasing, but they may not be monotone outer-continuous. To avoid this 
problem, we can define J^b = ^n^^g„{B) f'^^ ^ ^ A{u). For C G C{u) \A, let 
G*c = y BdAiu), Bnc=0 ^B, and for A G ^, A 7^ 0, define Ga = -^0- Note that 
{Qc} is ^ decreasing family of a-fields: if C C C', then C Finally, 
if C = A \ i? is a maximal representation of C, then = Tb- This was 
shown in [6]. 

Definition 2.7 ^c/. /P/;.- 

• A (A -indexed) stochastic process X = {Xa '■ A G A} is a collection 
of random variables indexed by A, and is said to be adapted if Xa is 
J-' A-measurable for every A & A. 

• X is said to be integrable if E[\ Xa |] < cxo \/A G A. 

• A process X : A ^ H is increasing if for every u & Q, the function 
X.{uj) can be extended to a finitely additive function on C satisfying 
X0/(a;) = and Xc{uj) > 0, VC G C, and such that X.^u) is mono- 
tone outer- continuous (i.e. if (An) is a decreasing sequence of sets in 
A{u) such that n„y4„ G A{u), then lim„XA„(ci;) = Xr,^A„{^))- (It is 
clear that any trajectory of an increasing process can be extended to 
a measure on B. Note that an increasing process is not necessarily 
adapted.) 

• An integrable process M = {Ma, A G A} is called a strong martingale 
if it is adapted and for any C E C, E[Mc\Q*c\ = 0. If the process M is 
not adapted, it will be called a pseudo-strong martingale. 

• A process X is called a *- compensator of the process X if it is increas- 
ing and the difference X — X is a pseudo-strong martingale. 
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Before moving on to adapted random sets, we should note that ahhough 
we are restricting ourselves to a compact set T, all of the definitions and de- 
velopments throughout this work can easily be expanded to a locally compact 
space using the structure found in Definition 2.1 of [8j. 

2.2 Clouds and *-clouds 

We start by giving the definition of an adapted random set, a notion that 
was first introduced in [9]. Recall that a closed set D is a domain if D = D°. 
As in [9], let /C be the class of domains D in T such that X{dD) = 0, where 
A is the measure defined at the beginning of Section \'2.1\ and let £ be the 
class of open sets that are complements of sets in /C. 

Definition 2.8 A random set rj : Q ^ B is an adapted random set if for 
any t eT, {cu : t e r]{ij)} E T At- 

• An adapted random set p taking its values in C is a cloud. 

• An adapted random set ^ taking its values in K, is an anti-cloud. 

We now define a new kind of random set, but before doing so we need 
another assumption. 

Assumption 2.9 Define Dt = E^. Then Dt E A{u) \f t eT. Also, if (t„) 
is a sequence such that t <tnWn and tn [ t, then Dt = r\ Dt„ ■ 

This is crucial to some later developments, since = T \ Dt and so we 
will have that E C. We may interpret E^ as the 'strict' future of t, and 
we shall define the 'wide' history of t to be J-'^ := Q*^o = J^Dt- 

Now we are in position to define *-adapted sets: 

Definition 2.10 A random set rj : Q ^ B is a ^-adapted random set if for 
any t E T, {cu : t E i]{uj)} E T*. 

• A *-adapted random set p taking its values in C is a *-cloud. 

• A ^-adapted random set C, taking its values in K, is a *-anti-cloud. 
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The following theorems were proven in [5] for anti-clouds. We give the 
statements for *-anti-clouds; the proofs are analogous to their anti-cloud 
counterparts. For details, see [2]. We recall from the definition of an in- 
creasing process that if X is increasing, it can be extended as a measure 
to B. Therefore, X^{uj) := X^(^)(a;) is well-defined for any anti-cloud or 
*-anti-cloud. 

Theorem 2.11 Let X he an increasing process (or the difference of two 
increasing processes) on A and suppose that ^ is a *-anti-cloud. Then for 
any A E A, both Xah^ and XAndi ^'^^ random variables. 

Now we deal with the filtered process := X^nA- 

Theorem 2.12 Let ^ be a ^-anti-cloud and X = Y — W , where Y and W 
are increasing processes such that Yq^ = Wg^ = a.s. 

1. If X is a (pseudo)- strong martingale, then X^ is a pseudo-strong mar- 
tingale. 

2. X^ will not generally be adapted, even if X is. 



3 Hazard Estimation and Observability 
3.1 The Nelson- Aalen Estimator 

The model is the same as the one in [8] and [9]. Assume that P) is a 

complete probability space. Let Y : VL ^ T he a. T- valued random variable, 
and /i(-B) = P{Y G B} its distribution. The survival function associated 
with Y is S{t) = fi{Et). We assume that /i is absolutely continuous with 
respect to A (A as in Q and denote by fi the Radon-Nikodym derivative of 
fi with respect to A on the Borel sets of T. Finally, we will assume throughout 
this section that the indexing collection is ^ = {A^ : t G T}. 

Definition 3.1 • Fort G T, the hazard function ofYish where 

hit) = ^ 

"^^^ s{t) • 

If S{t) = 0, h{t) is defined to be zero. 
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• The integrated hazard function ofY is H where 

Ha = / h{u)\{du) for any A E A. 

J A 

Let = {Na,A G A} = {Ijy^a},A G A} be the single jump process 
associated with Y and JF^ = {J^^,A G A{u)} its minimal filtration: JF^ = 
(r{NB : B e A,B C A}U {Vq}, where Vq is the class of P-nuU sets. This 
filtration is in fact outer-continuous, as was proved in [7j. iV is increasing, 
and it was proved in [B] that it has a ^-compensator: 

Proposition 3.2 (^8], Proposition 2.9) The process N defined by 

Na= ii{Eu)~^ li{du) = / h{u)X{du) = / J{ye£;„}/i(u)A((in) 

JAnAy JAnAy J A 

is a * -compensator of the process N with respect to its minimal filtration, 
where Ay{uj) = Ay(^). 

Suppose we have a T-valued random variable Y whose associated single 
jump process is adapted to a filtration and an jF-*-cloud p with cor- 
responding >K-anti-cloud ^ = p'^. The filtered jump process A^.^ := /{yg^n } 
corresponds to observing occurrences of Y only on the complement of the 
*-cloud; then we can say that N has been filtered by the *-cloud p. 

Example 3.3 This example shows that our framework includes the usual 
bivariate censoring model, as presented in pjj. We assume that T = [0.1]^ 
(or any bounded rectangle in R^) and let A denote Lebesgue measure. Let 
Y = (Yi, Y2) G [0, 1]^ be a two-dimensional failure time on a probability space 
P), and suppose that JF satisfies J-'J C JF^ for every t G [0,1]^. Let 
t = itiM) e [0,1]2, ^1;) = yt,^{t,M)^ = Vii^(ti,t2), and let r = {n,T2) 
be a two-dimensional censoring time, where is an jF*^*^-stopping time for 
i = 1,2. 

Suppose we can observe YAr = (Yi Ari, Y2AT2) and /{y-<T-j, i = 1,2. Now 
we can express everything in terms of sets, letting At = [0,t], and ^ = [0,r]. 
^ is a *-anti-cloud, since 

{t G O = {(^1,^2) G [0,ri]x[0,r2]} = {t, < n}n{t2 < ra} G H'^®.f1? C 
Finally, the counting process of censored times is 

N^(t) = N^^ = I{YAT<t,Yi<ri,Y2<r2} = ^{YGAtH^}- 

□ 
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In what follows, if ^ : f2 — > /C, let JF^ denote the minimal (^-indexed) 
filtration with respect to which ^ is a *-anti-cloud. 

Definition 3.4 Let Y be a T -valued random variable and let J^^ be the mini- 
mal filtration generated by its associated jump process N . Let T be a filtration 
such that T\ C T a G A{u) and let ^ be an J-'-* -anti- cloud. ^ is 

1. weakly independent of Y if the *-compensator of N with respect to T 
is the same as the -^-compensator with respect to ; 

2. independent of Y if is independent of and T a = V T\, 
VA G A. 

The following lemma is an immediate consequence of Theorem I2.12[ 

Lemma 3.5 Suppose that is a *-anti-cloud, that Y and ^ are weakly inde- 
pendent and that the filtration T satisfies T a = ^\ V JF^, \j A ^ A. If N is 
the single-jump process associated with Y , then {N — NY is a pseudo-strong 
(T-)martingale. 

Now we are able to define the Nelson-Aalen estimator of the integrated 
hazard function using filtered data. We will use the same definitions as in 

Henceforth, we assume that we have a sequence of i.i.d. T-valued random 
variables (Fj) with the same distribution as F, as well as a sequence (^j) of 
*-anti-clouds independent of the F/s. Define the following processes: 
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By independence and Lemma I3.5[ is a ^-compensator for A^^") , and 
the process 

M(") = Ar(")^ - J Zn{t)h{t)X{dt) (2) 
12 



is a pseudo-strong martingale with respect to JF, the minimal filtration gen- 
erated by the sequences (Yj) and (^j). Since 

Ar(")^(dt) = Zn{t)h{t)X{dt) + M^''\dt), 

regarding M*^") as noise, we come to a set-indexed version of the Nelson- Aalen 
estimator for Ha'- 

"'^'-L^- S (3) 

We observe that H^^ — Ha = J a ^{t)'' ' following is analogous to Propo- 
sition 4.5 of [9j; for details, see [2j. 

Proposition 3.6 i^^^") — H is a pseudo-strong martingale. 



3.2 The Observability Problem 

The Nelson-Aalen estimator defined in ([3]) is identical to that introduced in 
[9], with the exception that in [9J it was assumed that ^ is an anti-cloud. 
We will now explain the reason for incorporating the more general censoring 
mechanism (*-clouds) into the survival model. 

Example 3.7 Ideally, we would like to have all the information regarding 
the events in JF^ at time t. Unfortunately, this is generally not possible 
in practice, and we have to settle for somewhat more limited information. 
Specifically, if ^ is a fully observable anti-cloud, typically we may only be 
able to observe 

Ht = 0-{/{yeAnC}, ■ ^ ^ A, s e Dt} CJ^t- 

We need the event I^YeEt}^{teO to be observable in order to construct the esti- 
mator for the integrated hazard function. But the problem is that Ij^Yf^Etj^ite^} 
is not Tit-measurable, and so the estimator cannot be used. Indeed, suppose 
t E ^ and DtdC,^ ^ 0. If Y is not observed in Dt fl ^, it is impossible to know 
whether Y E Df (1 C,'^ or Y E E^. This situation is illustrated in Figured] for 
T c R^. 

To correct this, we can define $,*{^) = {t '■ Dt H ^"^{uj) =0}. ^* is a 
*-anti-cloud, since {t e C} = {Dt n = 0} = {A ^ O = HseDti^ e 
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Figure 1: The observability problem: we don't know whether Y G Et. 

C,} = riseTojI-s ^ C,} ^ 'Ht J-'l, recalling that Tq^ stands for a countable 
dense subset of Df. Figure [2] pictures the *-anti-cloud ^* corresponding to 
the anti-cloud ^ in Figure [H 

Now consider I^YeEt}^{te^*}- We have that if t G then Dt C ^, and so 

{rG^t}n{tGr} = {YeDtYn{teC} 

= {¥ eDtn^Yn{tee}^nt. 

This means that I{Y£Et}I{t(^i*} is 7-^rnieasurable and hence observable, and 
now we are able to calculate the estimator. 

Of course, there are instances where it is not possible to observe I{te^} 
whenever the observation Y happens before time t; in other words, when the 
anti-cloud is not fully observable and Y & At. In this case, the information 
available up to time t consists of 

Ht = cr {I{YeAn(}, I{se^nE^} ■ A C Dt, s E Dt} C J^*; 

nevertheless, we can still observe the event I{Y(zEt}I{te^*}- indeed, using the 

fact that {Y G Et} = Hs^r^ois G E^} and {t G T} = Hs^T^ois G we 

t t 

have that 

{Y eEt}n{teC}= f] ({^ e ^y} n {s g O) = fl {seE^n^e^t. 

t t 
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Figure 2: The resulting *-anti-cloud 



This example demonstrates that an estimator can always be constructed, 
but in some sense is a worst-case scenario: we observe that the move from 
^ to ^* as defined above may entail articially censoring observed values of Y 
that lie in ^\^*. Therefore, in practice, we should examine each information 
structure closely to determine whether the Nelson-Aalen estimator can be 
constructed using the cloud C, (cf. Example 13. 101) . and if not, how best to 
define an appropriate anti-cloud that censors as few observations as possible. 
This will be illustrated in Example 13. 9[ □ 

Example 3.8 We now consider a generalization of Example 13.31 Again, 
assume that T = [0,1]^ and A = Lebesgue measure. Let Y = (Yi,Y2) G 
[0,1]^ be a two-dimensional failure time on a probability space (fi,jF, P), 
t = {ti,t2) G [0, 1]2 with C for every t G [0, 1]^ and J^JJ^ as in Example 
13. 31 for i = 1,2; but instead of having a single two-dimensional censoring time, 
we will consider a finite sequence of jF^*^-stopping times = rjn < un < rii2 < 
z/j2 < . . . < ijini < Uin, for 2 = 1, 2. Let s = (si,S2) and u = {ui,U2) and 
suppose we can observe 

= {^{y.<s„y.eU;iifei,^..]}'^{^»6K-.^«..+i)-'7»,.+i<«J : s,ueAt; i = 1,2}. 

An illustration of this kind of situation can be seen in Figure El This sce- 
nario could arise in a laboratory, where test animals are under continuous 
observation during the day, but not at night. 



15 



'723 
1^22 





1 1 1 
1 1 1 




1 1 1 1 1 
1 1 1 1 1 






-\T\- 




n n r 






1 1 1 
1 1 1 
1 1 1 
1 1 1 
1 1 1 
1 1 1 




1 1 1 1 1 
1 1 1 1 1 
1 1 1 1 1 
1 1 1 1 1 
1 1 1 1 1 
1 1 1 1 1 






+ 14 14 1- 





14 14 1- 
14 14 1- 
14 14 1- 

1 1 1 1 1 






1 1 1 
1 1 1 
1 1 1 




1 1 1 1 1 
1 1 1 1 1 






J^ll ?7l2 1^12 113 

Figure 3: We can observe the failure times in the unshaded area. 



Let r = {\JT=i[ni3, i^ij] X U"ii[r?2,-, iy2j]}. We have that 

ni Ji2 

i=i i=i 

ni 712 

= {^1 e U [vij, ^ij]} n {^2 e U [ri2j, i^2j]} 

ni n2 

= ( U {^1 e [vij , ^ij]}) n ( U {^2 e [r)2j , 1^2^]}) 

so ^* is a >K-anti-cloud. Then I{YeEt}^tei*} can be observed and we get our 
estimator for the integrated hazard function. □ 



Example 3.9 Once more, assume that T — [0, 1]^ and A = Lcbcsgue mea- 
sure. Suppose now that we have a sequence of i.i.d. T-vahicd random vari- 
ables (Fj) = (^1,4, l2,i) with the same distribution as Y, as well as a sequence 

of i.i.d. [0, l]-valued random variables Ki^i < K2,i for i — 1,2, Define 

the filtration J^'^^''^^ = cr{-^{«i,i<si}, -^{«2,i<si} '■ si < h}, and let J^it^^) — 

Hi,t,)'^K""^ as well as T^'^ = yt,^{h,t,), J"?^ = Vt^^M- Let := 
{(^1,^2) : < ti < K2,i, ti < t2 < ti + c} represent the censored region, 
where c denotes a constant. Then it is easy to see that pi is a sequence of 
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Kl K2 

Figure 4: The regions p and p*. 



*-clouds, since the stopping times. This kind of data structure can 

arise when, for example, Yi is the age of start of pregnancy and Y2 is the age 
of onset of a disease such as tuberculosis. At ki, it is found that the test 
used for diagnosing the disease is dangerous for a pregnant woman, so once 
the pregnancy starts it becomes impossible to diagnose that disease for the 
next nine months. At time K2 though, a new safe test becomes available and 
the times of onset of the disease are no longer censored. See Figure |H 

In this case, we would censor the pairs (^1,4, ^2,2) on the extended region 
C = P*i ■= {(tiM) ■■ < h < K2,i, h < t2 < K2,i + c} U {{ti,t2) : 
ti < Ki^i, Kl j < ^2 < f^2,i + c} in order to be able to observe the indicators 
I{Y&Et}I{t(^^*} needed to obtain the Nelson- Aalen estimator, as illustrated in 
Figure HI Note that p* is also a sequence of *-clouds, since all the endpoints 
of the intervals in the definition of each p* are stopping times with respect 
to both !F[]^ and J-'il^; hence 

m,t2) G p*} E j^ii^ ® j^if c V ^i? = j^i,^,^y 

The censored region defined by p* is clearly much smaller than the *-clouds 
of Example 13. 7[ and would result in less lost data. □ 

Example 3.10 There are examples that fit the original censoring model of 
^ when T = [0, 1]^ and A = Lebesgue measure. Suppose that ^ is a cloud 
which is a lower layer and that we can observe 

= {hy^<s^}^ hn<^o ■ s,u e At] i = i, 2}. 
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Since ^ is a lower layer, Ii^YeEt}km} = " ^{Y'i<ti})(l - ^{y2<t2})^{t65} ^ Ht, 
and the cloud ^ may be used in An application of this model to a medical 
data set in given in [3]. □ 



4 A Functional Central Limit Theorem 

In this section, for clarity of exposition we will assume that T = [0, 1]^ and 
A = Lebesgue measure, but all results can be extended to bounded rectangles 
in R'^. Our objective is to prove a functional central limit theorem for the 
T-indexed process 

We will use the functional delta method, for which [T3] is an excellent refer- 
ence. 

Throughout this section, we will make extensive use of the following no- 
tation. If X is an arbitrary set, then the Banach space /°°(X) is the set of 
all functions / : X — R that are bounded uniformly and equipped with 
the norm ||/|| = sup^ C[0, 1]^ denotes the continuous functions, and 

^[0, 1]2 is the Banach space of all functions / : [0, 1]^ — > R continuous from 
the upper right quadrant and with limits from the other quadrants. Both 
C[0, 1]^ and D[0, 1]^ are equipped with the uniform norm. Finally, BVm[0, 1]^ 
denotes the space of all functions in D[0, 1]^ with total variation bounded by 
M. Products of these spaces will always be equipped with a product norm. 

The next lemma will be used in the proof of the functional central limit 
theorem. It is a two-dimensional version of Lemma 3.9.17 in [13], and since 
the proof is similar, it will be omitted. 

Lemma 4.1 For each fixed M , the maps : /°^[0, 1]^ x BVm[Q, 1]^ — > R and 
V;:r[0,l]2x Wm[0,1]2^D[0,1]2, 

(f){A,B)= [ AdB, i){A,B){t)= I AdB (4) 

"'(0,1] J{o,t] 

are Hadamard-differentiable tangentially to C[0, 1]^ x D[0, 1]^ at each {A, B) 
in /°°[0, 1]^ X 5Vm[0, 1]^ such that J \dA\ < oo, and the derivatives are given 
by 

(I>'ab{c(^P) = [Ad(3+ [ adB, ij^'j^^ia, (3)(t) = [ Adp + [ adB, 

J J ' J(0,t] J{0,t] 
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where J Adj3 is defined via the two-dimensional integration by parts formula 
found in Theorem 8.8 of ^ if j3 is not of hounded variation. 

Before moving on to the central limit theorem, we have to make some 
assumptions that will allow us to apply the delta-method to our processes. 

Assumption 4.2 P(t E ^) is continuous in t E T and there exists e > 
such that P{t G > e for every t eT. For all s,t eT, P{s G - t G 
^ K \ s — 1\, where K is a constant and \-\ denotes the Euclidean norm. 

Assumption 4.3 The survival function S satisfies a Lipschitz condition of 
order 1 . 

Assumption 4.4 //^(r, r) > 0, then g{-) := (S'(-)P(- G 0)""^ ^■^ of hounded 
variation on [0, r 
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The following functional central limit theorem is the main result of this 
article. 

Theorem 4.5 Under Assumptions ^.^ and \4.3[ 

(^(^n(-) - E{Z^{.))), ^(iV^-^i - E{N(-)i))^ (A, r) 

where A and T are tight Gaussian processes on C[0, 1]^ and D[0, 1]^ respec- 
tively. Moreover, U^"^ =^ G in /°°[0,r]^ for every r such that S{t,t) > 0, 
where G is a mean-zero Gaussian process such that for t E T, 

^ r dTju) _ r A{u)h{u) 
* J[o,t] S{u)P{u G 'm S{u)P{u G 

The first term in Gt is defined hy integration hy parts. 

Before proceeding with the proof, we make a few observations and prove 
a lemma that will be required. The statement of the CLT is very similar 
to Example 3.9.19 in [TH]. One of the differences, obviously, is that we are 
working on two dimensions instead of one. However, the major difference 
resides in our censoring mechanism and the information we are provided 
with; a consequence of this is that Zn, the survivor function process, will not 
be in D[0, 1]^. This lack of sample path regularity necessitates Assumptions 
and 14.31 above, which yield the following lemma and its corollary: 
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Lemma 4.6 Under Assumptions \4-S\ and \4-3[ the sequence of processes 
Dn{t) '■= -^[Znit) — E{Zn{t))] convcrgcs in distribution in /°°[0, 1]^ to a 
tight Gaussian process A taking its values on C[0, 1]^. 

Proof: Let Wt = I{YGEt} = -^{te?}- Note that Assumption 14.21 implies 

that E \Vt - Vs\ <2K\t- s\, since 

Then we have that 

= E[E{\WtVt-WtVs\)\Wt,Ws] 

= E [\Vt - Vs\ \Wt = Ws = l] P{Wt = Ws = l) 

+ E[Vt\Wt = l,Ws = 0]PiWt = l,Ws = 0) 

+ E[Vs\Wt = 0,Ws = l]P{Wt = 0,Ws = l) 

< 2K\t- s\ S{s V t) + P{t G 0{S{t) - S{s V t)) 

+ P{seO{S{s)-S{s\/t)) 

< K*\t-s\, 

for some constant K*, where the first inequahty follows from the indepen- 
dence of the processes V and W and the observation at the beginning of this 
proof; the second is a consequence of Assumption 14.31 But by a nearly iden- 
tical argument to that in Example 2.11.14 in [T2j -we use two-dimensional 
blocks instead of closed intervals- this implies that the sequence of processes 
defined by 

in 1 

-^[T.hy.eE,}htei.} - nS{t)P{t G 0] = -^[Zn{t) - E{Z,,m = Z}„(t) 

converges in distribution to a tight Gaussian process A on /°°[0, 1]^. Conti- 
nuity of A is a consequence of the fact that E\Dn{s) — < Ki\s — t\ 
for some constant Ki < oo (cf [13], pg- 41): from Assumptions 14.2 1 and 14. 3( 
we have that 

E\D^{s) - D^{t)\^ = S{s)P{seO + S{t)P{teO-'2S{swt)P{s,teO 

-[S{s)P{s GO- S{t)P{t G 0]' 
= [S{s)P{s GO- S{s V t)P{s, t G 0] + [S{t)P{t G 
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-Sis V t)P{s, teO]- [S{s)Pis GO- Sit)Pit e 0? 

= [S{s) - Sis V t)]P{s G + [Sit) - Sis V t)]Pit G 

+Siswt)[PiseO-P{s,teO] 
+Sisyt)[PiteO-Pis,teO] 
-[Sis)PiseO-Sit)PiteO? 

< Ki\s-t\. 



□ 



Corollary 4.7 Under Assumptions \4 -SI and \4.3\ 

^"^^^ Sit)PiteO 



sup 

t6[0,l]2 







as n oo. 

We are now ready to proceed with proof of our main result. 

Proof of Theorem 14. 5t Since the ^j's are i.i.d., we have that the sequence 
of processes (C„) defined by 

1 " 

Cnit) := hy.^A^ni.} - nPiY e A,Y e 0] 

i=i 

converges in distribution to a tight Gaussian process F on D[0, 1]^. This 
follows simply from the CLT for empirical processes, since we are working 
with a subdistribution. Combined with Lemma [4. 6 [ this means that 

^[(Z.(-),iVHi) - iEiZ^i.)),EiN(-)i))] (A,r) 



n 



on /~[0, l]'^xD[0, 1]^ where (A, T) is a Gaussian process on C[0, l]'^xD[0, 1]^. 

Recalling ([2]), we note that the estimator depends on the pair ( — , ) 
through the maps 

iAB)^ij,B)^ J jdB. (5) 

It is a consequence of Lemma 1411 that the map ([5]) is Hadamard-differentiable 
tangentially to C[0, 1]^ x Z}[0, 1]^ on a domain of the type 
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{{A,B): J\dB\ < M,A> e} for given M and e > 0, at every point {A, B) 
such that 1/A is of bounded variation. Whenever t is restricted to an in- 
terval [0,r]^ such that S{t) > 0, by Assumption 14.41 and Corollary 14.71 the 

pair ^^^'^ ) is contained in this domain with probability tending to 1 for 
M > 1 and e sufficiently small. The derivative map is given by 



{a, (3) 



1/A)dl3 - j {a/A^)dB. 



Now we can apply the delta-method to conclude that 
where 

dViu) r A(u) 



Gt 



m S{u)P{u e AoA {S{u)P{u e 



dP{YeA^,YeO 



is again Gaussian. As in Lemma 14. the first term in the limiting process 
has to be defined by integration by parts, since F may not be of bounded 
variation. Finally, we observe that P(Y G At, Y E ^) = /^^ P{u G C,)dfi{u) = 
Ja, S{u)P{u G ^)h{u)du. 

The covariance structure for G will be given later in this section. □ 

En route to finding the covariance structure for G, we need the following 
proposition. 



Proposition 4.8 Under Assumptions \4-^ and \4-3[ if A is a Borel subset of 
[0, r]^ where S{t, t) > 0, then as n oo, 



- Ha) 



M^^^dt) 



I A S{t)P{t G i) 
Proof: For any Borel set A C [0,r]^, 



n 



1 



A Znit) 

n 



M^''\dt) 

1 \ M^'^^dt) 



+ 



A\Z^{t) S{t)P{teOj 
1 M^'^^dt) 
A S{t)P{t G ' ' 



(6) 
(7) 
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Using Corollary 14.71 we can show that converges in probability to with 
exactly the same argument as in the proof of Theorem 5.1 in [S]. □ 

Remark 4.9 Proposition 14. 81 gives us the central limit theorem for the finite- 
dimensional distributions of ^/n{H^^^ — Ha) over a more general class of Borel 
sets, since s{t)P{t<^i) ' ^^'^^/n^^^ normalized sum of n i.i.d. processes, 

as noted in [8]. This comes up short of giving us a functional CLT since we 
need to prove tightness. It is for this reason that we believe the use of the 
Delta-Method is a more elegant approach to this particular problem. 

The next proposition is identical to its counterpart in [9J (Corollary 4.7). 

Proposition 4.10 Let g he a continuous function on T . Then J g{t)M^"'\dt) 
is a pseudo-strong martingale and for C,D & C, 

Cov ^J^g{s)M^'^\ds), J^g{t)M^-\dt)^ (8) 

g\t)S{t)P{t e Oh{t)dt 



n 



+ / / g{s)g{t)iJi{E,nEt)P{s,tei)h{s)h{t)dsdt 

J Jl{C,D) 

where 

1{C,D) = {{c,d) eC X D -.ce A^^nE^^} 
= {{c,d) eC X D : de A^^nE^^}. 

Now that we have Propositions 14.81 and I4.10[ we are in position to give 
the covariance structure for the limiting process in Theorem 14.51 

Lemma 4.11 The covariance structure for the process G in Theorem \4.5\ is 
given, for C,D E C, by 

Coy{Gc,Gd) = I {S{t)P{teOr'h{t)dt (9) 
JcnD 

+ !! -J^^^^S13Ei^il^^h(s)h(t)dsdt 
+ J kcD) s{s)s{t)p{s e opit e 



Proof: From Proposition 14.81 we have that the only term that contributes to 
the covariance is ([7]), and so iQ follows by an application of Corollary 14. 101 □ 
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5 Applications 



For all our applications, we will assume that T = [0, 1]^, and that J^t = ^{ti,t2) 
is trivial if either ti = or ^2 = 0. We recall that JF||^ = \/t2^{ti,t2) and 
J-'l^ = \/ti^{ti,t2)- Assumptions 14.21 14.31 and 14.41 will be assumed to hold 
throughout this section. 

5.1 Test of Independence 

We now concern ourselves with the construction of a test of independence of 
the components (Yi, I2) of the random vector Y E T. We will follow closely 
the development of the ideas presented in [IT] . 

As noted in [8] and [H], when Yi and Y2 are independent, the cumulative 
hazard is the product of the marginal hazards. Therefore, we have to be able 
to estimate the marginal hazards in order to obtain a test of independence. 
We will review the four examples given in §3.21 in order to illustrate how this 
is done. 

• Example 13. 7t Given ^ and the corresponding *-anti-cloud define 

:= sup{ti : (ti,0) G r} 
K2 := sup{t2 : (0,t2) e r}, 

see Figure [5l note that Ki is an jF^^^^-stopping time: for any ti G [0, 1], 
< h} = {h < = {(^1,0) G e = ^H- 

Similarly, we can show that K2 is an ^^^•'-stopping time. Taken separately, 
Yj is censored on the intervals := {kj, 1] for j = 1,2. 

• Example 13.81 This example is illustrated in Figure [31 In this case, since 
i'ij,fiij are jF^'^-stopping times, i = l,2;,j = 1,2,..., Yi is censored on the 
jF*^*)-adapted clouds Ujiyij, = 1,2. 

• Example 13.91 Referring to Figure [H taken separately, Yi is not cen- 
sored at all, since it is always possible to determine whether the individual 
is pregnant or not, but Y2 is censored on the interval {ni^i, K,2^i + c). 

• Example 13.101 In this structure, Yi is censored on the right by the 
jF*^^^-stopping time 

= inf{s: (s,0) ^O- 
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Figure 5: The stopping times Ki and K2 

Likewise, Y2 is censored on the right by the ^^^^-stopping time 

K2 = inf{M : (0,m) ^ 

In each of these examples, we end up with the same type of one and two- 
dimensional structure: the pair {Yi, Y2) is filtered on an jF-*-cloud, and each 
Yj is filtered by an jF*^-''*-*-cloud a union of random open intervals with 
endpoints that are ^^-^ ''-stopping times for j = 1,2. We would like to note 
that in classical one-dimensional problems, the intervals are usually taken to 
be left-open and right-closed in order to ensure predictability, but since we 
are assuming that P{Yg^ = 0) = 1, the endpoints of the intervals can actually 
be ignored. 

Now we are in position to define the analogous one- dimensional processes 

1=1 

€2 " 

-^2"^ (^2) = Y.hY2,i<t2}hY2,i&i2,i} 

i=l 
n 

■^n,l(^l) = E-^{^i,i>*i}'^{*ieCi,i} 
1=1 
n 

■^71,2(^2) = X]-^{^2,i>t2}-^{t2e6,i}' 
i=l 
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as well as the processes Cn,j{tj) = ^^^/^[ivj") ' (tj) - E(A^j"^ ' (tj))] and 
Dn,j{tj) = n^^^'^[Znj{tj) — E(Znj{tj))] for j = 1, 2. We will also make use of 
the one-dimensional analogues of the process M*^"^: 



Lemma 5.1 n^^^H^"^ - H,H^^^ - Hi,h!^^ - H2) =^ (G,Gi,G2) ^n the 
space /°°([0, Ti] X [0, T2]) x /°°[0, ti] x /°°[0, T2] for every r = (ti, sfic/i i/iai 
S'(r) > 0, where the vector {G,Gi,G2) is jointly Gaussian with continuous 
sample paths. 

Proof: We start very similarly to the proof of Lemma 4.1 in [11]. Recall that 



:= - EiZ^m and := n-'/^[N(-^i - i?(iV(")ij]. 



We already saw in the proof of Theorem 14.51 that C„) converges to 
(A,r) on /°°[0, If X L)[0, 1]2. We also know that [Dn,j,Gn,j) converges to a 
joint Gaussian process (Aj, Lj), j = 1, 2, on /°°[0, 1] x Z}[0, 1], since the same 
arguments for the convergence of C^) still apply. Therefore, the joint 
sequence [Dn-, -D„,i, -Dn,2, Cn, Cn,i, ^"71,2) is tight in the product of topologies of 
the space /°°[0, 1]^ x (/°°[0, l])^ x L)[o', 1]^ x (D[0, 1])^. This, coupled with the 
fact that its finite-dimensional distributions converge to those of a Gaussian 
process with continuous sample paths, give the convergence of the sequence. 
Next, note that both the estimators for the cumulative marginal haz- 

ards depend on the pairs {-^, — ), j = 1, 2, through the map ([5]), where 
the integral is defined on R instead of R^. Using the same arguments as 
in Theorem 14. 5 [ we can show that n^^'^{Hj"^ — Hj) =^ Gj in D[0,rj] for ev- 
ery Tj such that Sj{Tj) > 0, where Gj are mean- zero Gaussian processes 
for j = 1,2. As a consequence of this fact and the joint convergence of 
{Dn, Dn,i, -D„,2, Cn, Gn,i, Gn,2), we get the desired result. □ 

As was the case in pT], in order to construct our test of independence 
between (Yi^j) and (l2,i), i = 1,2,..., we will take the difference between 
the two-dimensional Nelson-Aalen estimator and the estimator under the 



Mf{-) = Nf^\-)-j 



Znj{uj)h{uj)X{duj), 



J = l,2. 



The cumulative marginal hazards are estimated by 
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hypothesis of independence. Define Vn := n^/'^{H'^'^^ — Hi^^H^^^) on [0, r], 
where r = {ti,T2). We could calculate the variance of V^, but as Pons 
remarked in p^, its practical estimation is not easy and therefore not very 
useful. That is why we prefer to use a bootstrap test based on V^, and our 
efforts will now turn towards that goal. 

We shall use the delta-method for the bootstrap, and for that we need to 
verify that the classes of functions needed are Donsker classes. Let Xi, . . . , Xn 
be a sample of random elements in a measurable space {X,A) with distri- 
bution P, let ^ be a collection of measurable functions g : X ^ "R, and let 
Pn be the empirical measure of the Xj, which induces a map from ^ to R 
defined by P^^g := 9{X,). 

Ivanoff and Merzbach observed in [9] that an anti-cloud is measurable as 
a random closed set as a consequence of part 2 of their Corollary 3.10; it 
is easy to see that the same is true of *-anti-clouds. Now assume that we 
can fully observe the pair Then we can take to be the ran- 

dom elements Xi in the previous definition. Let P be the joint distribution 
of the pair {Y-i.^i), let P„ denote its empirical measure, P„ the bootstrap 
empirical distribution and Gn the bootstrap empirical process defined by 
Gn '■= y/n{Pn — Pn)- For a more complete description of these elements, as 
well as for the exact statement of the delta-method for bootstrap in proba- 
bility -which we will use a few lines ahead- refer to Sections 3.6 and 3.9 of 
[13], respectively. 

Recalling that T = [0, 1]^, let = {g^ : t e T} and = {gf : t G 
T} be defined, respectively, by g]iYi,ii) := I{Y,(iEt}I{t<^ii^} and gtiYi,ii) : = 
Ii^Y^(zAt}I{Yi(^ii}- Then both and are Donsker classes, since both were 
shown to be convergent in distribution to tight Gaussian limits on /°°[0, 1]^ 
in the proof of Theorem 14. 5[ We can apply Theorem 2.10.6 in ^^3] to find 
that the pair {Q^,Q'^) is also a Donsker class, since each one is uniformly 
bounded. It is also clear that and have finite envelope functions be- 
cause both classes are comprised exclusively of indicator functions. Hence, 
by Theorem 3.6.1 in [13j, the conditions (3.9.9) of [I3| are met. Furthermore, 
we have shown that the map ([5]) is Hadamard-differentiable tangentially to 
C[0, 1]^ X D[0, 1]^ on a certain domain. Then we only need to apply Theorem 
3.9.11 in p3|, the delta-method for bootstrap in probability, to get the condi- 
tional convergence given [Yi, ^j) of the bootstrapped Nelson-Aalen estimator. 
Similar arguments lead to the same conclusion for the one-dimensional boot- 
strapped Nelson-Aalen estimators. 
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The above is summarized in the next lemma, which is an analogue of 
Lemma [5. II for the bootstrapped estimators. 

Lemma 5.2 Assume we can observe the pairs {Yi,^i) for i = 1,2, . . and 
let {Yi,^i) denote the bootstrap sample. Then if H^'^\ H'f'\ represent the 
bootstrapped versions of the joint and marginal Nelson- Aalen estimators, we 
have that n^/'^{H^'^^ — Hi"^ — Hi'\ H2 — -^2"^) converges conditionally 
given {Yi,^i) to the Gaussian process (G, Gi,(j*2) of Lemma \5.1\ 

Now that we have seen that the bootstrapping is working correctly under 
the assumption of complete observability of the pairs (Fj, ^j), we note that the 
bootstrapped version of the statistics only involve the product of indicator 
functions qi = I{YiGAt}I{Y,e(,} and = I^^Y,(^Et}I{^<^i^}■ Therefore, we can drop 
the assumption of complete observability, since those indicators can still be 
constructed because the events involved are observable under the survival 
model. Therefore, we can in fact bootstrap directly from the observed values 
QiyTi and come up with the same estimators. 

We can deduce from Lemma EH] that on [0, ri] x [0,r2], 

(nV2[^(«)(t,,t2) -i^{"^(ti)^f^(t2) - iH{t,,t2) - H,{t,)H2{t2m 

converges in distribution to a mean-zero Gaussian process; indeed, we can 
rewrite K = ni/2(^(") _ H^^^ H^^^) as 

hence 

^1/2 ^jj(n) _ Hin)fj{n) _ _ _ n"\H^''^ - H) 

- n'/'iHt^ - H,)H2 

- n'/'iHt^-H2)H,, 

which converges weakly to G — H2G1 — H1G2. Lemma \57I\ guarantees us that 
for the bootstrapped estimators. 



sup 

t6[0,r] 
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converges in distribution to the sup of the absolute value of that same Gaus- 
sian process. Then we can bootstrap from (gi,rj) jointly and find Cq, so that 
(1 — a) 100% of the absolute values of ffTOj) fall below Cq, which would give us 
(1 — a) 100% uniform confidence bands for the difference H — H1H2. 

Finally, for our test of independence Ho . H = H1H2 vs. Ho : H ^ H1H2, 
we reject Ho if sup^gjQ 



n 



1/2 



H(-\t„t2) - H[''\t,)H^"\t 



5.2 Test of Hazard Rate Order 

Let Xi, . . . , Xn and Yi, . . . ,Ym he independent random samples from bivari- 
ate distributions F and G respectively, and suppose that there is a common 
censoring mechanism in the form of a sequence of independent *-clouds. We 
are interested in testing whether the distributions are equal on a particular 
set or, more generally, on a suitable class of sets, against the alternative that 
there is a difference in the hazard rates. 

We start with the case where we look at the hazards on a fixed set A. The 
null hypothesis is Ho : F = G, and we can test it either against a single-sided 
alternative such as H^ < H^, or the double-sided alternative H^ ^ H^. 
Tests using a one-sided alternative leads us to tests of hazard rate order such 
as, Ho : F = G vs. Hi : < on A, since this alternative would imply 
Hj^ < H^. It is natural to consider the difference between the Nelson-Aalen 
estimators H^ and H'~' of the integrated hazards, which leads to the test 
statistic 

AN) _ (Mn)F Mm)G 



N 

where N = n + m. 

By Remark 14.91 we still have that both U^^^ = y/n{H^^^^ — H^) and 
jj(j^)G _ y^^^^^™)*^ _ ffG^ converge to independent mean-zero Gaussian 
limits and respectively. We can write 



Under the null hypothesis of equality of the cumulative hazards on the set 
A, and assuming — > A as n, m ^ 00, we get that 

The limit variable has the same distribution as under the null hypothesis. 
Then we have a test of asymptotic level a for a two-sided alternative if we 
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reject the null hypothesis whenever 



> W 



where we choose w^^'' 



such that w^'^^ = inf : P{ > t) < a|. A test with a one-sided 

alternative would be handled similarly; we just have to remove the absolute 
values. 

We can also consider testing for the hazard rate over a whole class, as long 
as it is Donsker in order to preserve the Gaussian limits. An example would 
be to take the class of rectangles A = {Az = [0, 2;] : z G [0, r]}, where [0, r] = 



[0,ri] X [0,r2] for r = (ri,r2) 
valid, but now use sup^gjo^r 
as well as choosing w^^^ - 



All the previous remarks about W^^"^ are still 
and sup^gjQ ^^e test statistics. 



W 



w 



(N) 
A, 

F . 



W 



(N) 



inf |t : P(sup 



ze[0,r] 



w 



> t) < a| and 

inf |t : P(sup2g[o^^] U^^ > t) < for two-sided and one-sided 
alternatives, respectively. 

Assuming we can fully observe the pairs (Xj, ^j), {Yj, ^j) for z = 1, . . . , n 
and j = 1, . . . ,m, we determine the appropriate critical value for w^^^ by 
bootstrapping from the pooled sample S^^^ = (Xi, ^1), . . . , (X„, 
{Yi,^[), . . . , (Ym,^'m), where we choose any of these observations with prob- 
ability l/N, as was done in Section 3.7.2 in [13]. Define J = XF -|- (1 — X)G, 
and note that under the null hypothesis, H'^ = H^. We assign the first 
n elements from the resampling to F, and the rest to G. Let if ("'^)'^ and 
jjim,N)G ^Yie Nelson-Aalen estimators for the pooled sample assigned to F 
and G respectively, and H^^^ the estimator for the complete pooled sample. 
Set 

Next, note that by Hadamard-differentiability and Theorem 3.7.6 in [13] . 
we know that (ff^"-^)^ - U( and (ff^™-^)^ - 

U2 , where f// and f// are independent mean- zero Gaussian processes with 
covariance as defined in Lemma 14.111 with S equal to the survival function 
of J and h = J'/S. Then if we let Aat = n/X, 



which converges in distribution to y/1 — Af//— -\/A[//, also a Gaussian process 
on [0, r] equal in distribution to f//. Then, as remarked in Section 3.7.2 of 
we can use 



Wat = inf < t : P{ sup W\ 



(N) 



>t) <a 
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as critical values for a one-sided test, and 

w)^ = inf it : P( sup \W^/'^\ > t) < a 

[ 2G[0,t] 

for the two-sided test. 

Once again, as we did following the statement of Lemma 15.21 we can 
see that the bootstrapped version of our test statistic only uses the func- 
tions Qi = Ilx,GAt}I{X,eC,}, Ti = Ii^x,GEt}hm.}' 1j = hYj(^At}I{Yjee,} 

r'j = I^YjGEt}^{te^'}'^ then we can safely drop the assumption of complete 
observability, given that even without it we can still construct the appropri- 
ate estimators. Thus, the test can be performed by bootstrapping directly 
from the recorded values qi,ri,qj,rj and following the procedure described 
above. 



5.3 The Hazard of a Copula 

Suppose we have a sequence (Xj, Yi) of i.i.d. bivariate random vectors with 
continuous distribution J and continuous, strictly increasing marginals F 
and G, and let H denote the integrated hazard function. Our goal is to 
estimate the hazard of the copula C associated with J. 

As usual, each pair (Xj, Yi) will be censored by an jF-*-cloud . Let 
C{p,q) = J{F~^{p),G~^{q)), the copula function for J. and let C be the 
survival function for the copula: C {p, q) = 1 — p — q + C {p, q) . It is straight- 
forward to verify that the integrated hazard of the copula, H'" , satisfies 

H'^ip, q) = H{F-\p), G-\q)\ < p, g < 1. (11) 

In other words, calculating the cumulative hazard of the copula at (p, q) is 
the same as calculating the cumulative hazard of the original distribution at 
the point G~^[q)). We would like to estimate the cumulative hazard 

of the copula, and the equation above seems to suggest that we could do so 
by looking at the Nelson- Aalen estimator for the integrated hazard of the 
distribution J evaluated at the appropriate quantiles of F and G. If F and 
G are unknown, we would replace G~^{q)) with the quantiles of the 

respective Kaplan-Meier estimates of the marginals F and G (See Section 
IV.3.1 in [IJ). 

Now, suppose for the moment that F and G are known. Define the 
pseudo-observations {Xf .Yf") := (F(Xj), G'(Fj)), so that the distribution of 
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(X*, Fj*) is the copula C. The observable region then becomes 

^* = {{Fix),Giy)):{x,y)e^,}, 

and the new filtration (now defined on [0, l]^) is J^*{p, q) = J^{F~\p), G-\q)). 
Using Equation ([3]), we have 

H^''Hp,q)= E iZ*iX*,Y*))-\ (12) 

{i:{Xf ,Y*)<ip,q), {Xf ,Y* ) GCf } 

But note that 



n 

\{Xf,Y*)&E^,^}h{n,v)&if} 



i=l 



so 



m = E (z„(x„y;))-i 

{i:(X„Y,)<(F-^{p),G-^{q))XX„Yi)&i,} 

= H^-\F-\p),G--\q)), (13) 

and hence we have an empirical analogue of ffTT]) . More generally, for D C 
[0,1]2, let (F-i,G-i)(D) := {(F-Hp),G-i(g)) : p,q E D}, in which case 
we can estimate with H^j^^^ = H^''\F~\G-^){D). The asymptotic 
normality of ^Jn{H^^^'" — iJ|^) follows from Remark I4.9[ 

If F and G are unknown, the next step is to estimate the quantiles F~^ 
and G~^. We will do so by taking the appropriate quantiles of the Kaplan- 
Meier estimators F, G of F and G respectively; then our estimator will be 
fj{n)c ^ hM(F-\G-^), where P-^p) = inf{s : F{s) > p} and G-\q) is 
defined in a similar manner. It is important to note that A^p^i^^^^ ^^i^^^^ is 

still a *-anti-cloud: indeed, since F{s) and G{t) are adapted to J-'^^\s) and 
J^^'^\t) respectively, 

(s,t)GA(^_,(p)^^_,(^)) = {s < F-\p)} n {t < G-\q)} 

= [f{u) <p, V m < s} n [G{v) <q, \/ V <t} 
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An immediate consequence of this is that our estimator still maintains a 
pseudo-strong martingale structure. Moreover, under Assumptions I4.2[ 14.41 
and 14.31 the composition map 

o G-^) (14) 

is Hadamard-differentiable tangentially to C[0, 1]^ x BVm[0, 1]^ x (C[0, 1] x 
-BVa/[0, 1])^ on a domain analogous to map ([5]): the maps 0i and 03 are 
Hadamard-differentiable as a consequence of Theorem 14.51 02 's Hadamard 
differentiability comes from Lemma 3.9.30 in [13], and 04 and 05 are Hadamard- 
differentiable by Example 3.9.24 and Lemma 3.9.25, respectively, in 
again. Hence, the process y/n{H^"'^'^ —H'^) converges in Z°°[0, r] = l°°{[0, ti] x 
[0, r2]) to a mean zero Gaussian limit for every {ri,T2) such that 

SiF-\n),G-\T2)) = Cin,T2) > 0. 




An example: The FGM copula 

We can apply the preceding results to Farlie-Gumbel-Morgenstern (FGM) 
copulas. The FGM copula with parameter 9 is defined by Cg{u,v) = uv + 
9uv{l—u){l—v). Suppose Cj^ and Cj^ are two FGM copulas with parameters 

and 9j^, respectively, and we are interested in testing Ho : Cj^ = Cj^ vs. 
Hi : < Cj^. For FGM copulas, this is equivalent to testing H'^ : 9j-^ = 9j^ 
vs. H[ : 9j^^ < 9j^. This test can be performed by 'translating' these con- 
ditions to those of a test of hazard rate order: we look for a specific set A 
under which < 9j^ is equivalent to h^-'^ {u, v) < h'^-'^ {u, v) for {u, v) G A, 
and then we can apply the results of the preceding sections. 

We have Cg{u, v) = {1 — u){l — v) + 9uv{l — u){l — v), as well as = 
1 + 9(1 - 2m) (1 - 2v). Then h^-'i{u,v) < h^J-i{u,v) becomes 

l+9j,{l-2u){l-2v) l + 9j^{l-2u){l-2v) 

{l-u){l-v) + 9j^uv{l-u){l-v) ^ {l-u){l-v) + 9j^uv{l-u){l-vy 

which is equivalent to 

{9 J, - 9j,)uv < {9 J, - 9j,){l - 2u){l - 2v) 
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after cross-multiplying and rearranging terms. Under the alternative hypoth- 
esis < Oj^, the previous inequality becomes 

uv <{1- 2u){l - 2v), 

and finally 

l-2u-2v + 3uv> 0. 

This means that, under H[ : < Oj^, we have h'-'-'2{u,v) — h'-'-'i{u,v) > 
for every {u,v) lying in the set {{u,v) : 1 — 2u — 2v + 3uv > 0}. Now in 
order to ensure that Cj. is uniformly bounded below, i = 1,2, define A : = 
[0, r] n {{u, v) : 1 — 2u — 2v + 3uv > 0} for any r = (ri, T2) with Tj < 1, 
i = 1,2. 

We consider two scenarios. First, suppose the distributions Ji and J2 have 
the same (known) marginal distributions F and G. Since Jji{h'-^'2 —h'^-'i )dX = 

H^ ''^ — Hy^ '^ , an appropriate test statistic would be (letting N = n + m) 

which would get large ii > 6j^. 

More realistically, suppose the marginal distributions of Ji and J2 are 

c c 

unknown. If Oj^ > Oj-^, then i?^^^ — Hj^^^ > for every u & A. In this 

case, an appropriate test statistic is = sup^g^ -y/^l-^i'T''*"''^ ~-^a1'""^^ )> 
which would get large if Oj^ > 6j^. We need to consider two different cases 
for calculating the critical values for our test statistic. 

If it is reasonable to assume that the marginals from the two distributions 
are equal, we proceed with the bootstrapping as indicated in Subsection 15.21 
Under Ho, we know that w'^'^ converges to the sup over A of a mean- 
zero Gaussian limit U^-^^ , and our test of level a would reject Ho whenever 
W'f^ > ivN, where wn = inf {t : P{wj^^^ > t) < a}. 

On the other hand, if the assumption of equal marginals is not justified, 
we can no longer pool the samples. In this case we will start by bootstrapping 

each sample separately. Let H^^''^''^ and ifj^™'*'""'^ denote the bootstrapped 
Nelson- Aalen estimators. Now let 

which can be rewritten as 
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Assuming that ^ A as n, m oo, the last expression converges con- 
ditionally given the original samples to v^Af/'^-^a — y/1 — \U^'^ by Lemma 
15.21 where f/'-^-'i and f/'-^-'a are the independent mean-zero Gaussian limits of 
^(^^{«)Cji -}jCj^ -) y/m{H^"^^'^-'2 ) respectively. Under the null hy- 
pothesis, W^"^ has the same limiting distribution as W^'™'^ := sup^g^ U^^'^\ 
Then we have a test of asymptotic level a if we reject the null hypothesis 
whenever > Wn,m, where Wn,m = inf {t : P(Vri"''") >t)< a\. 
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